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ABSTRACT 

We consider the probe of astrophysical signals through radio interferometers with small field 
of view and baselines with non-negligible and constant component in the pointing direction. In 
this context, the visibilities measured essentially identify with a noisy and incomplete Fourier 
coverage of the product of the planar signals with a linear chirp modulation. In light of the 
recent theory of compressed sensing and in the perspective of defining the best possible imag- 
ing techniques for sparse signals, we analyze the related spread spectrum phenomenon and 
suggest its universality relative to the sparsity dictionary. Our results rely both on theoretical 
considerations related to the mutual coherence between the sparsity and sensing dictionaries, 
as well as on numerical simulations. 
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1 INTRODUCTION 

Aperture synthesis in radio interferometry is a powerful technique 
in radio astronomy, allowing observations of the sky with other- 
wise inaccessible angular resolutions and sensiti vities. This tech- 
nique dates back to m ore than sixty ye ar s ago |Rvle & Vonberd 
1 19461: iBlvthd 1 19571; iRvle et all 1 19591 : iRvle & Hewishl Il96dl ; 
iThompson et alj|2004h . In this context, the portion of the celestial 
sphere around the pointing direction tracked by a radio telescope 
array during observation defines the original real signal or image 
probed x. The field of view observed is limited by a primary beam 
A, Standard interferometers are characterized by a small field of 
view, so that the signal and the primary beam are assumed to be 
planar. Considering non-polarized radiation, they respectively read 
as scalar functions x(l) and A(l) of the position I £ R 2 , with 
components (l,m). Each telescope pair at one instant of observa- 
tion identifies a baseline defined as the relative position between 
the two telescopes. To each baseline b\ 6 R 3 , with components 
(u, v, w) in units of the signal emission wavelength A, is associ- 
ated one measurement called visibility. In the simplest setting one 
also considers baselines with negligible component w in the point- 
ing direction of the instrument. Under this additional assumption, 
if the signal is made up of incoherent sources, each visibility corre- 
sponds to the value of the Fourier transform of the signal multiplied 
by the primary beam at a spatial frequency u £ R 2 , identified by 
the components (u, v) of the baseline projection on the plane of the 
signal. Radio-interferometric data are thus identified by incomplete 
and noisy measurements in the Fourier plane. In the perspective of 
the reconstruction of the original image, these data define an ill- 
posed inverse problem. 
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It is well-known that a large variety of natural signals are 
sparse or compressible in multi-scale dictionaries, such as wavelet 
frames. A band-limited signal may be expressed as the N- 
dimensional vector of its values sampled at the Nyquist-Shannon 
rate. By definition, a signal is sparse in some basis if its expansion 
contains only a small number K <C N of non-zero coefficients. 
More generally it is compressible if its expansion only contains a 
small number of significant coefficients, i.e. if a large number of its 
coefficients bear a negligible value. The theory of compressed sens- 
ing demonstrates that, for sparse or compressible signals, a small 
number M < JV of random measurements, in a sensing basis in- 
coherent with the sparsity basis, w ill suffice for an acc u rate and 
stable reconstruction of the signals { Candes etal.ll2006dlbl:ICandesl 
l2006l : lDonohcl2006l : lBa7aniukl2007l : lbonoho & Tannerl2009h . The 
mutual coherence between two bases may be defined as the max- 
imum complex modulus of the scalar product between unit-norm 
vectors of the two bases. Random Fourier measurements of a sig- 
nal sparse in real space are a particular example of a good sensing 
procedure in this context. 

In a very recent work dWiaux et alj|2009h . we presented re- 
sults showing that compressed sensing offers powerful image re- 
construction techniques for radio-interferometric data, in the case 
of baselines with negligible component w. These techniques are 
based on global Basis Pursuit (BP) minimization problems, which 
are solved by convex optimization algorithms. We particularly il- 
lustrated the versatility of the scheme relative to the inclusion of 
specific prior information on the signal in the minimization prob- 
lems. 

In the present work, we raise the important problem of the 
dependence of the image reconstruction quality as a function of 
the sparsity basis, or more generally the sparsity dictionary. The 
larger the typical size of the waveforms constituting the dictio- 
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nary in which the signal is sparse or compressible in real space, 
the smaller their extension in the Fourier plane, and the smaller 
the incoherence between the sparsity and sensing dictionaries. In 
the context of compressed sensing, a loss of incoherence leads to 
a degradation of the reconstruction quality for a given sparsity K 
and a given number M of random measurements. 

The detailed structure of radio-interferometric measurements 
might actually provide a natural response to this issue. The approx- 
imation of baselines with negligible component w is a key assump- 
tion in order to identify visibilities with Fourier measurements of 
the original signal. This approximation actually sets a strong con- 
straint on the field of view probed by the interferometer, requiring 
that it is small enough, not only for the planar approximation of the 
signal but also to neglect the complete effect of the component w 
in the visibilities. We relax this approximation and consider radio 
interferometers with small field of view and baselines with non- 
negligible component w. In this context, each visibility at spatial 
frequency u identifies with the Fourier transform of a complex sig- 
nal obtained as the product of the original planar signal multiplied 

i,i2 

by the primary beam with a linear chirp e 17TW < 1 where the norm 
\l\ identifies the distance to the center of the image. Note that this 
chirp is a priori characterized by a different chirp rate for each base- 
line: w = w(u). In the Fourier plane, the modulation amounts to 
the convolution of the Fourier transform of the chirp with that of the 
signal multiplied by the primary beam, which inevitably spreads the 
two-dimensional sample power spectrum of the constitutive wave- 
forms. This spread spectrum phenomenon increases the incoher- 
ence between the sparsity and sensing dictionaries. 

In this context, we define signals made up of Gaussian wave- 
forms with equal size, identified by a standard deviation t, consid- 
ered as a rough representation of any kind of astrophysical struc- 
ture. We make the simplifying assumption that all baselines, iden- 
tified by the spatial frequencies u, have the same component w, 
so that all visibilities are affected by the same chirp. A theoret- 
ical computation of the mutual coherence between the sparsity 
and sensing dictionaries as a function of t and w naturally proves 
that the coherence decreases when w increases, through the spread 
spectrum phenomenon. Our theoretical relation also shows that the 
coherence may be decreased as close as required to the minimum 
coherence between the real and Fourier spaces through the use of a 
large enough chirp rate w. In light of the theory of compressed sens- 
ing, this suggests some universality of the spread spectrum phe- 
nomenon according to which, for a given sparsity K and a given 
number M of random measurements, the quality of the BP recon- 
struction would not only increase when w increases, but could also 
be rendered independent of the sparsity dictionary for large enough 
chirp rate w. In other words, the BP reconstruction could be made 
as good for signals sparse in a dictionary of Gaussian waveforms 
of any size, as for signals sparse in real space. We produce simu- 
lations of the signals considered, and perform BP reconstructions 
from noisy visibility measurements. Our results confirm the univer- 
sality of the spread spectrum phenomenon relative to the sparsity 
dictionary. 

Let us acknowledge the fact that spread spectrum techniques 
are widely used in telecommunications. In this context, chirp mod- 
ulations are used in particular to render transmitted signals more 
robust to noise. In the context of compressed sensing, spread spec- 
trum techniques using pseudo-random modulations have very re- 
cently been highlighted as a means to enhanc e the signal recon - 
struction and render it stable relative to noise l lNaini et alj|2009h . 
The use of chirp modulations has also been proposed in compressed 
sensing radar to produce a convolution of a sparse signal in the spar- 
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Figure 1. Illustration of notations. The signal probed x(t), with t = s — 
so, extends in any direction on the celestial sphere S 2 identified by unit 
vectors s £ R 3 , around the pointing direction sq £ R 3 . The field of view 
of the radio interferometer is set by the primary beam A(t). The baselines 
also characterizing the interferometer are vectors 6^ £ R 3 with norm | b\ \ . 
The coordinate system R : (O, ei, e.2, £3) in R 3 is illustrated, with £3 
identified with so , e.2 pointing toward north, and ei pointing toward east. 
The components of the direction vectors s on the sphere are denoted by 
(I, m, n), while those of a baseline are denoted by (u, v, w). 

sity basis itself before performing measurements in tha t same basis 
jBaraniuk & Steeghsll2007l; iHerman & Strohmedl2009t) . In this re- 
spect, these techniques are much more related to coded aperture 
techniques dGottesman & Fenimorelll989l ; lMarcia & Willetdl2008t> 
as we ll as to compressed sensing by random convolution dRombergl 
l2008h than to spread spectrum techniques. 

In Section [2] we formulate the interferometric inverse prob- 
lem for image reconstruction on small field of view in the presence 
of a non-negligible and constant component w of the baselines. In 
Section[3] we concisely recall results of the theory of compressed 
sensing regarding the definition of a sensing basis and the accu- 
rate reconstruction of sparse or compressible signals from BP. In 
Section|4] we establish the universality of the spread spectrum phe- 
nomenon for a sparsity dictionary made up of Gaussian waveforms, 
both from theoretical considerations and on the basis of simula- 
tions. We finally conclude in Section|5] 



2 RADIO INTERFEROMETRY 

In this section, we recall the general form of the visibility mea- 
surements and study the approximation of a small field of view and 
baselines with non-negligible component w. We identify in particu- 
lar the corresponding spread spectrum phenomenon in the presence 
of a constant component w. We also pose the corresponding inter- 
ferometric inverse problem for image reconstruction. 

2.1 General visibilities 

In a tracking configuration, all radio telescopes of an interferomet- 
ric array point in the same direction on the unit celestial sphere 
S 2 identified by a unit vector so £ R 3 . Arbitrary directions are 
denoted by unit vectors s £ R 3 . The field of view observed 
is limited by a so-called primary beam A(t), where the vector 
r = s — so £ R 3 identifies directions relative to the pointing 
direction (see Fig.QJ. The size of its angular support is essentially 
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inversely proportional to the size of the dishes of the telescopes. 
We consider a monochromatic signal x(t) with an emission wave- 
length A, and made up of incoherent sources (see Fig. QJ. Also 
considering non-polarized radiation, both the signal and the pri- 
mary beam are scalar square-integrable functions on the sphere, 
x(t), A(t) £ L 2 (S 2 , dfl(r)), where df2(r) stands for the invari- 
ant measure on the sphere. At each instant of observation, each 
telescope pair measures a complex visibility defined as the corre- 
lation between incoming electric fields at the positions of the two 
telescopes. This visibility only depends on the relative position be- 
tween the two telescopes, defined as a baseline. The baseline in 
units of A is denoted by a vector b\ £ R 3 (see Fig.[T|l. Each v isi- 
bility measured reads in general form as dThompson et al .112004b : 

y(b x )= [ Ax{r)e- 2M >-- T dfi(r). (1) 
Js 2 

We consider the conventional right-handed Cartesian coordinate 
system R : (O, ei, e»2, 6:3) in R 3 centered on the Earth. The direc- 
tion £3 identifies with the pointing direction Jo. The directions e.\ 
and e.2 identify the orthogonal plane, parallel to the plane tangent 
to the celestial sphere at the pointing direction, with e.2 pointing 
toward north, and ei pointing toward east. The unit vectors s may 
be identified by their components (I, m, n) in R (see Fig.[TJ. Each 
point is thus obviously identified by the two-dimensional vector 
I £ R 2 with components (I, m) in the plane (O, ei, e.2), while the 
component in direction e-j reads as n(l) = (1 — i| 2 ) 1//2 with the 
norm \l\ = (I 2 + m 2 ) 1//2 identifying the distance to the origin in 
the plane (O, ei, e.2). In particular, so bears by definition the com- 
ponents (0, 0, 1) so that t reads as (I, n(l) — 1). Signal and primary 
beam may also be seen as functions of this two-dimensional vector, 
respectively x(l) and A(l). The invariant measure on the sphere 
reads as dfl(r) = n~ 1 (l)d 2 l, where d 2 Z = dldm stands for the 
canonical invariant measure in the plane (O, ei, en). The compo- 
nents of b\ are standardly denoted by (u, v, w), so that its norm 
reads as \b\\ = (u 2 +v 2 +W 2 ) 1 ' 2 (see Fig.Q~|l. These may also be 
divided into the two-dimensional vector u £ R 2 with component 
(u, v) in the plane (O, ei, £2) and the component w in direction 
e3, i.e. in the pointing direction of the radio interferometer. 

The general form {XJ of the visibilities reads in terms of these 
coordinates as: 

y(u,w)= [ Ax(l)e- 2i7rlu - l+w{n{l) ~ 1)] n- 1 (l)d 2 l, (2) 
Jd 2 

where the integration is performed inside the unit disc D 2 on the 
plane (O, ei, e.2). 

In the course of an observation, for each telescope pair, the 
baseline components u, v, and w all follow a sinusoidal dependence 
in time thanks to the Earth rotation, with specific parameters linked 
to the parameters of observation. The total number of points (it, w) 
probed by all telescope pairs of the array during the observation 
provides some coverage in R 3 characterizing the interferometer. 

2.2 Small field of view and non-negligible w 

Note that for a given telescope array, the baseline component w 
may be seen as a function w — w(u) of the two-dimensional vec- 
tor u with an implicit dependence on the distance between the two 
telescopes considered. Each visibility may thus be seen as a func- 
tion of the two-dimensional vectors u: y(u, w) — y(u). From this 
point of view, the points u probed by all telescope pairs during 
the observation provide some coverage of the plane (O, e 1,6:2) 
also characterizing the interferometer. The two-dimensional vec- 



tor u associated with each telescope pair actually runs over an arc 
of ellipse in this plane. 

In this context, we make the two assumptions of small field of 
view and baselines with non-negligible component w. Firstly, we 
consider a standard interferometer with a primary beam of small 
enough angular support, so that the field of view may be identified 
with a patch of the tangent plane at so. Technically this amounts 
to assume that \l\ 2 <C 1 on the field of view so that the zero order 
expansion n(l) ~ 1 is valid in the invariant measure on the sphere, 
which therefore identifies with that on the plane: dfi(r) = d 2 i. 
The signal and the primary beam hence become scalar square- 
integrable functions in the plane x(l),A(l) £ L 2 (R 2 ,d 2 i). Sec- 
ondly, we consider interferometers with large enough values of the 
w(u), so that the same approximati on on n(V) does not hol d for the 
phase of the imaginary exponential dThompson et al.l2004l) . In this 
case any term discarded in n(l) should be small relative to w~ 1 (u) . 
We assume however that the further constraint on the field of view 
\l\ 4 < w _1 (u) holds for all u, so that the second order approx- 
imation n(l) ~ 1 — |i| 2 /2 is valid in the imaginary exponential. 
Under these conditions, each visibility takes the form: 

y{u) = / Ax (1) e v " 1 e d I. (3) 

Jd 2 

In other words the visibility in u identifies with the value in u of the 
two-dimensional Fourier transform of a complex signal obtained as 
the product of the original planar signal Ax(l) with a linear chirp 
modulation C (w{u)) (\l\) = ilrtu («)l'l 2 . This chirp is characterized 
by a chirp rate w(u) depending on it, and |2| now identifies the 
distance to the origin in the plane of the signal, i.e. to the point 
identified by so. 

Note that in the further approximation that w(u) — for all 
u, the chirp indeed disappears and relation © reduces to the stan- 
dard van Cittert-Zernike theorem. This theorem states that the vis- 
ibilities measured identify with the two-dimensional Fourier trans- 
form of the signal m ultiplied by the primary beam: y(u) — Ax(u) 
dWiaux et alj2009h . 

The fact that baselines may have non-negligible component 
w is at the origin of important challenges from the computational 
point of view. This issue is related to the fact that the explicit com- 
putation of simulated visibilities in $3$ from a signal and the cor- 
responding inverse problem have a large complexity. In the case 
of baselines with negligible component w(u) at each u, the simple 
two-dimensional Fourier transform relation between signal and vis- 
ibilities allows the use of the Fast Fourier Transform (FFT), which 
substantially lightens the computation. This is not possible any- 
more in the case of baselines with non-negligible component w(u) 
as the chirp modulation is characterized by a chirp rate explicitly 
dependent on u, and therefore needs to be imposed separately for 
each visibility. A number of algorithms have been studied in this 
case, among which fac eting algorithms as w ell as the very recent 
w-projection algorithm dCornwell et al.l2008h . In one word this w- 
projection algorithm simply computes a unique FFT, after which 
the visibility at each u is obtained by the evaluation, at this u, of 
the convolution of the Fourier transform of the signal with that of 
the chirp with chirp rate w(u). Considerations on the extension of 
the corresponding convolution kernel in the Fourier plane allows to 
drastically reduce the computational load in practice. These algo- 
rithmic issues are however not our concern here. We only aim at 
discussing how this natural and compulsory chirp modulation may 
drastically enhance the quality of signal reconstruction. 
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2.3 Constant w and spread spectrum phenomenon 

We already acknowledged that, in the course of an observation, the 
two-dimensional vector u associated with each telescope pair runs 
over an arc of ellipse, and the corresponding component w follows 
a sinusoidal evolution. However, the whole baseline distribution in 
K 3 is extremely dependent on the specific configuration of the ra- 
dio telescope array under consideration. Visibilities from various 
interferometers may also be combined, as well as visibilities from 
the same interferometer w ith different pointing d irections through 
the mosaicking technique dThompson et alj|2004l) . From this point 
of view the baseline distributions are rather flexible. 

Relying on this flexibility we assume that all baselines have 
the same component in the pointing direction: w{u) = toQ This 
assumption of constant component w allows us to discard consid- 
erations related to specific interferometers. Moreover, it also allows 
us to study the impact of a chirp modulation on the basis of a large 
range of simulations at light computational load. Under this as- 
sumption all visibilities indeed identify with the values of the two- 
dimensional Fourier transform of a complex signal obtained as the 
product of the original planar signal Ax(l) with the same linear 
chirp C^"'(|Z|) = e"™'*' characterized by the chirp rate w inde- 
pendent of u: 

y (u) = C^Ax (u) . (4) 

Note that the multiplication by the chirp modulation amounts 
to the convolution of the Fourier transform of the chirp with 
that of the original signal multiplied by the primary beam: 
C( w )Ax — Ci w ) * Ax. This convolution inevitably spreads the 
two-dimensional sample power spectrum of the signal multiplied 
by the primary beam, i.e. the square modulus of its Fourier trans- 
form, while preserving its norm. 

In particular, the original signal x(l) and the primary beam 
A(l) can be approximated by band-limited functions on the finite 
field of view set by the primary beam itself. A linear chirp C' w ' 
with chirp rate w is characterized by an instantaneous frequency wl 
at position I. On the finite field of view set by the primary beam it 
is therefore approximately a band-limited function. In each basis 
direction, the band limit of the signal after convolution B^ c Ax ^ 
is the sum of the individual band limits of the original signal mul- 
tiplied by the primary beam and of the chirp modulation 
B(° W >: B«^>a*) = B (o<«>) + B (A*)_ This simple consid . 

eration already quantifies the spread spectrum phenomenon asso- 
ciated with the chirp modulation. Note for completeness that the 
primary beam is by definition limited at much lower frequencies 
than the signal so that it does not significantly affect its band limit: 

2.4 Interferometric inverse problem 

The band-limited functions considered are completely identified by 
their Nyquist-Shannon sampling on a discrete uniform grid of N = 
N 1/2 x N 1/2 points U G R 2 in real space with 1 i N. 
The sampled signal is denoted by a vector x G R^ = {xi = 
x(li)}i^i^N and the primary beam is denoted by the vector A G 

1 Note for illustration that, for a unique interferometer pointing to the north 
celestial pole, each telescope pair would exhibit constant w during obser- 
vation. For particular configurations of telescopes in two east-west linear 
arrays, all telescope pairs with each telescope belonging to a different array 
would produce baselines with identical value of w. 



W N = {Ai = A(li)}i^i^N ■ The chirp is complex and reads as 
the vector C (w) G C N = {C^ w) = C {w) {\U |)W<jv- Because 
of the assumed finite field of view, the functions may equivalently 
be described by their complex Fourier coefficients on a discrete 
uniform grid of TV = N 1 * 12 x 7V 1//2 spatial frequencies Ui with 
1 i ^ N. This grid is limited at some maximum frequency 
defining the band limit. Due to the fact that the chirp is complex, 
the Fourier transform of the product Ax does not bear any 
specific sy mmetry property in the Fourier plane. 

As in IWiauxetalJ ( l2009h . we assume that the spatial frequen- 
cies u probed by all telescope pairs during the observation belong 
to the discrete grid of points m. The Fourier coverage provided by 
the M/2 spatial frequencies probed Ub, with 1^6^ M/2, can 
simply be identified by a binary mask in the Fourier plane equal to 1 
for each spatial frequency probed and otherwise. The visibilities 
measured may be denoted by a vector of M/2 complex Fourier co- 
efficients y G C A///2 = {j/f, = y(ub)}i^i > ^M/2, possibly affected 
by complex noise of astrophysical or instrumental origin, identified 
by the vector n 6 C M ^ 2 = {nt = n(u b )}i^b^M/2- Formally, 
the measured visibilities may equivalently be denoted by a vector 
of M real measures y € R M = {y r }i^ r ^M consisting of the 
real and imaginary parts of the complex measures, affected by the 
corresponding real noise values n G R M = {n r }i^. r ^M- 

In this discrete setting, the Fourier coverage is in general in- 
complete in the sense that the number of real constraints M is 
smaller than the number of unknowns N: M < N. An ill-posed 
inverse problem is thus defined for the reconstruction of the signal 
x from the measured visibilities y: 

y = <t> (u,) a; + n with <J> (u,) = MFC (ul) A, (5) 

where the matrix <J> (u,) G £(M/2)xN identifies the complete 
linear relation between the signal and the visibilities. The ma- 
trix A G R JVxJV = {Aij = Ai5ij}i<:ij<: N is the diago- 
nal matrix implementing the primary beam. The matrix C^ w ' G 
C NxN = {Clf = Cf' 1 ^}!^ is the diagonal ma- 
trix implementing the chirp modulation. The unitary matrix F G 
c nxn _ _ e -2i™ i -x 3 y JV i/2-t i ^.^^ iv implements the dis- 
crete Fourier transform providing the Fourier coefficients. The ma- 
trix M G R( M / 2 ) xJV = {M b:j } 1 ^ b ^ M /2;i^j^N is the rectangular 
binary matrix implementing the mask characterizing the interfer- 
ometer. It contains only one non-zero value on each line, at the 
index of the Fourier coefficient corresponding to each of the spatial 
frequencies probed u. 

In the perspective of the reconstruction of the signal x, rela- 
tion (|5) represents the measurement constraint. We take a statistical 
point of view and consider independent Gaussian noise on each real 
measure y r . Considering a candidate reconstruction x, the residual 
noise reads as n^™' = y — <t>( w 'x. The residual noise level es- 
timator, defined as twice the negative logarithm of the likelihood 
associated with x, reads as 

with n (w) G R A/ = {^'Jkkm. and where °" ( "*' ) stands for 
the standard deviation of the noise component n r . This noise level 
estimator follows a chi-square distribution with Al degrees of free- 
dom. Typically, this estimator should be minimized by the good 
candidate reconstruction. The measurement constraint on the re- 
construction may be defined as a bound \ 2 (x; ^ e 2 , 

with e 2 corresponding to some percentile of the chi-square distri- 
bution. Let us note that the expectation value of the x 2 is equal 
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to its number of degrees of freedom M, while its standard devia- 
tion is (2M) 1//2 . In other words, for a large number of degrees of 
freedom the distribution is extremely peaked around its expectation 
value. This fact is relate d to the well-known phenomenon of the 
concentration of measure l lCandes et al .l2006bh . As a consequence, 
the value e 2 is anyway extremely close to M. 

In this context, many signals may formally satisfy the mea- 
surement constraint. A regularization scheme that encompasses 
enough prior information on the original signal is needed in order 
to find a unique solution. All image reconstruction algorithms will 
differ through the kind of regularization considered. 



Let us recall that the l\ norm of a is defined as ||a||i = 
Y^i=i |Si [- The £2 norm of the residual noise is the standard norm 
of the corresponding vector: ||n||2 = iHiLi l^il 2 ) 1 ^ 2 - I n these 
relations, the notation |o| for a scalar a stands for the complex 
modulus when applied to a complex number and the absolute value 
when applied to a real number. The BP e problem is solved by ap- 
plication of non-linear and iterative convex optimization algorith ms 
JCombettes & Pesquetll2008l ; lvan den Berg & Friedlandej|2008l) . 

Note that the £2 norm term in the BP £ problem is identical to a 
bound on the \ 2 distribution with M degrees of freedom governing 
the noise level estimator. 



3.3 Randomness and incoherence 



3 COMPRESSED SENSING 

In this section, we concisely recall the inverse problem for sparse 
signals considered in the compressed sensing framework. We de- 
scribe the BP minimization problem set for reconstruction and re- 
call the randomness and incoherence properties for a suitable sens- 
ing basis. 



3.1 Inverse problem for sparse signals 

In the framework of compres sed sensin g dCandesetalJl2006allbl 
Candesl 120061 : iDonohd 120061 : lBaraniukll2007l ; iDonoho & Tanned 
20091) the signals probed are firstly assumed to be sparse or com- 
pressible in some basis. Technically, one considers a real signal 
identified by its Nyquist-Shannon sampling, denoted by the vec- 
R^ = {^iji^i^jv. A real orthonormal basis W G 
i^ij}i^i.j^N is also considered, in which the decom- 
= {aij-i^i^iv of the signal is defined by 



tor x G 

m,NxN _ 



position a G 
x = tya. 



(7) 



The signal is said to be sparse or compressible in this basis if it 
only contains a small number K <g N of non-zero or significant 
coefficients a; respectively. 

Secondly, the signal is assumed to be probed by M real linear 
measurements denoted by a vector y G R A/ = {j/rji^r^M in 
some real sensing basis <t> G R MxAr = {<6rj}i<r<M;i^j^iv and 
possibly affected by independent and identically distributed noise 
n G R 



{fir}i^rs;A/. This defines an inverse problem 



y = Qa + n with = *tl> G 



(8) 



where the matrix identifies the sensing basis as seen from the 
sparsity itself. The number M of constraints is typically assumed 
to be smaller than the dimension N of the vector defining the signal, 
so that the inverse problem is ill-posed. 



3.2 BP reconstruction 

The framework proposes the global BP minimization problem for 
the signal recovery. This problem regularizes the originally ill- 
posed inverse problem by an explicit sparsity or compressibility 
prior on the signal. In the presence of noise, the so-called Basis 
Pursuit denoise (BP e ) problem is the minimization of the £\ norm 
||a||i of the coefficients of the signal in the sparsity basis un- 
der a constraint on the £2 norm j | tx 1 1 2 of the residual noise, with 
n = y — Qa: 



Among other approaches jDonoho & Tanne3l2009h . the theory of 
compressed sensing defines the explicit restricted isometry prop- 
erty (RIP) that the matrix = <t>W should satisfy in order 
to allow an accurate r ecovery of sp arse or compressible signals 
dCandes et al]|2006allbl:ICandesll20o"a) . In that regard, the issue of 
the design of the sensing matrix <t> is of course fundamental. The 
theory offers various ways to design suitable sensing matrices, 
showing in particular that a small number of measurements is re- 
quired relative to a naive Nyquist-Shannon sampling: M <C N. 

One can actually show that incoherence of <t> with the sparsity 
or compressibility basis and randomness of the measurements 
will ensure that the RIP is satisfied with overwhelming probability, 
provided that the number of m easurements is large e nough relative 
to the sparsity K considered dCandes et al1l2006bl : ICandesll2006l 
l2008h . In particular, measurements may be performed through a 
uniform random selection of Fourier frequencies. In this case, the 
precise condition for the RIP depends on the degree of incoherence 
between the Fourier basis and the sparsity or compressibility ba- 
sis. The mutual coherence /j, between the orthonormal Fourier and 
sparsity bases may be defined as the maximum complex modulus 
of the scalar product between basis vectors of the two bases. If the 
unit-normed basis vectors corresponding to the lines of F and the 
columns of W are denoted by {fi}i^i^N and {ipj}i^j^N, the 
mutual coherence between the bases reads as: 



M(F,V)= max Jfi-^A 

1 <J z , j <C TV 



(10) 



min llalli subject to \\y — Qalla ^ e. 



(9) 



In other words this mutual coherence identifies with the maximum 
complex modulus of the Fourier coefficient values of the spar- 
sity basis vectors. Note that the inverse of the mutual coherence, 

(F, W), can also be called the mutual incoherence between the 
Fourier and sparsity bases. 

In this context, the RIP is already satisfied for a small number 
of measurements satisfying the constraint 

cM 

K ^ Nfi 2 (F, In 4 N ' 

for some constant c. Under this condition^ the BP e reconstruc- 
tion of signals well approximated by a K sparse signal is shown to 
be accurate and stable relative to noise as well as relative to com- 
pressibility, in the sense of a departure from exact sparsity. Let us 
emphasize the fact that the mutual coherence plays an essential role 
in relation d 1 1 b as for fixed M the sparsity recovered increases with 
the mutual incoherence, as K oc ^i~ 2 (F, ty). 

2 Let us also acknowledge the fact that this bound is not tight. Empiri- 
cal results i Candes & Romberg 2005; Lustig et al. 2007) suggest that ratios 
M/K between 3 and 5 already ensure the expected reconstruction quality. 
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The incoherence is notably maximum between the Fourier ba- 
sis and the real space basis identified by a sparsity matrix = A 
made up of unit spikes: /i(F, A) = N^ 1 ^ 2 . In the continuous limit 
N — > oo the real space basis identifies with a Dirac basis and the 
maximum incoherence is infinite, corresponding to zero coherence: 



lim m(F, A) 



0. 



(12) 



Hence, for a fixed number M of pure Fourier measurements, the 
sparsity recovered K is maximum for signals sparse in real space. 



4 SPREAD SPECTRUM UNIVERSALITY 

In this section, we define simple astrophysical signals sparse in a 
dictionary of Gaussian waveforms, and explicitly identify our sens- 
ing dictionary for radio-interferometric measurements. We com- 
pute the theoretical coherence between the sparsity and sensing 
dictionaries as a function of the chirp rate and of the size of the 
Gaussian waveforms, which suggests the universality of the spread 
spectrum phenomenon relative to the sparsity dictionary. We define 
an observational set up, and describe our simulations and specific 
reconstruction procedures. We finally expose the results of the anal- 
ysis, and comment on the need for future work along these lines. 



4.1 Sparsity and sensing dictionaries 

We consider simple astrophysical signals sparse in a dictionary of 
Gaussian waveforms, all with equal and fixed size given by a stan- 
dard deviation t £ R+. These signals are assumed to be probed by 
radio interferometers with a Gaussian primary beam A^ to> with a 
size identified by a standard deviation to 6 R+. The correspond- 
ing matrix thus reads as A'* '. The value to sets the size the field 
of view of interest, which must naturally be larger than the size of 
the Gaussian waveforms: to > t. We consider a small field of view 
and baselines with a non-negligible constant component w, so that 
the visibilities measured yt = y(ub) take the form (|4). Relying on 
the previous discussion relative to the flexibility of realistic base- 
line distributions, we also assume that the spatial frequencies Ub 
probed arise from a uniform random selection of Fourier frequen- 
cies. As for the assumption of constant w, this allows us to discard 
considerations related to specific interferometers. It also allows us 
to place our discussion in a setting which complies directly with the 
requirement of the theory of compressed sensing for random mea- 
surements. Let us however recall that specific deterministic distri- 
butions of a low number of linear measurements might in fact also 
allow ac curate signal reconstr uction in the context of compressed 
sensing iMatei & Meveill2008h . 

In this context, the interferometric inverse problem $5$ sim- 
ply arises from a uniform random selection of spatial frequencies 
with a sensing dictionary <t> = = MFC (l "' A (io) also 

parametrized by the size to of the primary beam. The sparsity dic- 
tionary identifying Gaussian waveforms of size t may be denoted 
by W = v|/w = ft*) T ne sensing dictionary as seen from the spar- 
sity dictionary itself therefore reads as G^'* '*' = 0("'> t o)r (t) = 
MFC M A (to) r (t) . 

Let us emphasize the fact that the sparsity dictionary is ob- 
viously not an orthogonal basis. Moreover, the sensing dictionary 
does not correspond anymore exactly to a random selection of vec- 
tors in an orthogonal basis. No precise compressed sensing re- 
sult similar to the bound dl It relating sparsity and mutual co- 
herence between such dictionaries was yet obtained. However, in 
the line of first results already proved with redundant dictionaries 



dRauhut et al .11 20081) . one can intuitively conjecture that a bound 
similar to ( Hit holds in our context, still exhibiting the same trade- 
off between sparsity and mutual coherence. In the perspective of 
assessing the BP e reconstruction quality, it is therefore essential 
to understand how the mutual coherence between the sensing and 
sparsity dictionaries depends on the size t of the Gaussian wave- 
forms and on the chirp rate w. 

4.2 Theoretical coherence 

After normalization of the vectors of the sparsity and sensing dic- 
tionaries in £2 norm, a simple analytical computation gives the mu- 
tual coherence between the sensing and sparsity dictionaries as: 



MFC 



(«0a(*o) rW 



2tt 



(13) 



Note that one can formally re-organise the decomposition of the 
matrix Q^ w ' to '^ into modified sensing and sparsity dictionaries, 
respectively * = MF and xj/^o.*) = c( m >A ( ' o) r (t) . In this per- 
spective, the mutual coherence ^(FC^"' A'* ', T'*- 1 ) identifies with 
the maximum complex modulus of the Fourier coefficient values of 
the modified sparsity dictionary vectors, which depend on the chirp 
modulation and on the primary beam. Consequently, the require- 
ment for a lower coherence can be fulfilled by an operation that 
spreads the two-dimensional sample power spectrum of the signal 
in the Fourier plane, while preserving its norm. 

The relation U3\ is valid both for finite N as well as in the 
continuous limit N — > 00. In this continuous limit, analyzing the 
evolution of the coherence as a function of the parameters t, to, and 
w is very enlightening. 

Firstly, we consider finite non-zero values of the size to of 
the primary beam and of the chirp rate w. When the size t of the 
Gaussian waveforms tends to zero, the mutual coherence tends to 



lim 



i/i (fC (u,) A ( ' o) , 



-(t) 



for all w, to G 



(14) 



In the absence of chirp modulation and primary beam, this null 
value is expected from relation l l 1 _t as the modified sparsity dic- 
tionary identifies with the Dirac basis. But we also show that the 
mutual coherence gets to this minimum value independently of the 
chirp rate w in this limit, and less importantly independently of the 
size of the primary beam to. 

Secondly, we consider finite non-zero values of the size to of 
the primary beam and of the size t of the Gaussian waveforms, 
with t < to. In the absence of chirp, i.e. for w = 0, the coherence 
is strictly positive: ^(FC (0) A ( ' o) , r (t) ) = 2tt /(t 2 + t§). The fi- 
nite size t of a Gaussian waveform implies a Gaussian profile of its 
Fourier transform with a standard deviation (2ivt)~ 1 . The larger t, 
the smaller the extent of the two-dimensional sample power spec- 
trum in the Fourier plane, and the larger the coherence. As dis- 
cussed in Section |2~3| in the presence of a chirp, i.e. for w 7^ 0, 
the spectrum is spread by means of the corresponding convolution 
in the Fourier plane, while the norm of the signal multiplied by the 
primary beam is preserved. This spread spectrum phenomenon re- 
duces the coherence which, in light of relation Jilt , should enhance 
the quality of the BP e reconstruction in the context of compressed 
sensing. In the limit of an infinite chirp rate, the coherence also 
tends to its minimum null value independently of the size t of the 
Gaussian waveforms: 



lim Li(FC {w) A (to) ,r (t) ) = for all t, to G 



(15) 
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Figure 3. Real part (left panel) and imaginary part (right panel) of the chirp 
modulation with Wd = 1 assumed in the simulations. The chirp is sampled 
on a grid of 4JV pixels in order to avoid any aliasing artifact. Dark and light 
regions respectively correspond to positive and negative values. 



Gaussian size 


td = 2 


t d =i 


t d =8 


t d = 16 


Coherence for w d = 


0.0518 


0.103 


0.205 


0.400 


Coherence for w d = 1 


0.0517 


0.102 


0.174 


0.151 



Table 1. Values with three significant figures of the mutual coherence 
^(FC^'A^o), rW) between the sparsity and sensing dictionaries, for 
sizes of the Gaussian waveforms identified by td 6 {2, 4, 8, 16}, both in 
the absence of chirp modulation, i.e. for Wd = 0, and for a chirp modulation 
with Wd = 1. 

This limit provides a strong result in the sense that the incoherence 
lost by considering a sparsity dictionary of Gaussian waveforms of 
arbitrary non-zero size t may be completely recovered thanks to a 
chirp modulation with high enough chirp rate w. Still from relation 
dl It . this result suggests the universality of the spread spectrum 
phenomenon according to which the quality of the BP £ reconstruc- 
tion can be rendered independent of the sparsity dictionary for a 
large enough component w of the baselines. 

4.3 Observational set up 

The signals x considered are built as the superposition of A' = 10 
Gaussian waveforms, with random positions, and random central 
value in the interval [0, 1] in some arbitrary intensity units (iu). 
The signals are sampled as images x on a grid of N = 64 x 64 = 
4096 pixels. Fields of view L = L 1/2 x L 1/2 with L 1/2 not 
larger than several degrees of angular opening may be consid- 
ered for the approximation of a planar signal to be valid. A pri- 
mary beam A'* ' with a full width at half maximum (FWHM) 
2to(2 In 2) 1//2 = L 1//2 is also considered, centered on the sig- 
nal and with unit value at the center. In the discrete setting de- 
fined in Section [2~4l the inverse problem is transparent to the pre- 
cise value of the field of view so that we do not need to fix it. 
Observations are considered for M/2 complex visibilities with 
M/2 6 {50, 100, 200, 300, 400, 500, 1000}. The corresponding 
coverages of the spatial frequencies ut, relative to the total number 
of spatial frequencies u, in the Fourier plane up to the accessi- 
ble band limit on the grids B (N ' L) = N 1/2 /2L 1/2 , are roughly 
between 1% and 25%. Instrumental noise is also added as inde- 
pendent identically distributed Gaussian noise. The corresponding 
standard deviation er'" r ', identical for all r with 1 ^ r M, is 
set to (a^)f = 10- 3 (£< FA<fo) *>) 2 , where E< FA(to) *> stands for 
the sample standard deviation of the real and imaginary parts of the 



Fourier transform of the original signal multiplied by the primary 
beam FA'' ' a;. Note that this noise measure is independent of the 
chirp modulation, which preserves the norm of the signal. 

The size t of the Gaussian waveforms may be written in terms 
of a discrete size td as t = tdL 1 ^ 2 /nN 1 ^ 2 . We consider the values 
td 6 {2, 4, 8, 16} such that the Gaussian waveforms are well sam- 
pled on the grid. For td = 2 the approximate band limit B^ A< ^ 
of the signal is around the same value as B (N ' L) , while for t d = 16 
it is much smaller. All values also satisfy the constraint that the size 
of the Gaussian waveforms remains smaller than that of the field of 
view: t < to- Original signal samples multiplied by the primary 
beam are reported in the top panels of Fig. [2] one for each value of 
td considered. The middle panels notably represent, for each value 
of td, the one-dimensional sample power spectrum of the original 
signal multiplied by the primary beam, i.e. the average of the two- 
dimensional sample power spectrum on all spatial frequencies u on 
the same annulus at fixed radial frequency \u\ = (u 2 + v 2 ) 1 ^ 2 . 

The component w may also be written in terms of a discrete 
component Wd as w — WdN 1 ^ 2 / L. We consider two extreme val- 
ues. The case of baselines with negligible component w is iden- 
tified by Wd = 0. The case of baselines with non-negligible and 
constant component w is identified by wa = 1, corresponding to a 
linear chirp modulation with a maximum instantaneous frequency 
wL 1/2 /2 = B {N ' L) , i.e. with an approximate band limit B^™^ 
not much larger than B^ N,L ^ (see Fig. [3). In other words the com- 
ponent it; is a factor 2/L 1 ' 2 larger than the maximum value of the 
components u and v in the plane of the signal, so that for the small 
fields of view assumed the baselines are strongly aligned with the 
pointing direction. One can check that this value of w is consistent 
with the constraint for relations 10 and {4]l. 

Table Q] contains the values of the mutual coherence 
/i(FC (w) A (to) , T (t) ) for the values of t d and w d considered. These 
values are computed numerically from the definition d 10b . A com- 
putation from the theoretical relation dl3t provides the same results, 
up to a relative discrepancy of 10% related to the fact that in our 
setting the primary beam is not completely contained in the field 
view. These numerical values are thus completely in the line of our 
discussion of Section |4~2l Additionally, the one-dimensional sam- 
ple power spectra of the original signals multiplied by the primary 
beam, as well as those of the chirp modulation and of the signals 
spread by the chirp modulation are represented in the middle pan- 
els of the Fig. [2] directly below the corresponding original signal 
for each value of td- These graphs illustrate the spread spectrum 
phenomenon due to the chirp modulation and the reduction of the 
mutual coherence intimately related to it. Up to some normaliza- 
tion the mutual coherence is indeed essentially the maximum value 
of the square root of the one-dimensional sample power spectrum. 

4.4 Simulations and BP E reconstruction procedures 

A number of 30 simulations is generated for each value of td and 
Wd considered. The visibilities are simulated^ and the signals are 
reconstructed through the BP E problem, which is solved by convex 

3 In practice the grids considered allow the sampling at Nyquist-Shannon 
rate for signals with band limits up to B^ N ' L ' in each direction. This is not 
the case for the original signals after the chirp modulation. In order to avoid 
artificial aliasing effects in the discrete Fourier transform at the level of the 
frequencies probed, it is essential to introduce an operator increasing the 
resolution of the grids, by zero-padding in the Fourier plane, between the 
sparsity and sensing matrices. 
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Figure 2. From left to right, maps and graphs are associated with increasing values of the size td £ {2, 4, 8, 16} of the Gaussian waveforms constituting 
the sparsity dictionaries P*' in which the astrophysical signals considered are defined. The top panels represent original signal samples multiplied by the 
primary beam. Dark and light regions respectively correspond to large and small intensity values. The middle panels represent the one-dimensional sample 
power spectra in the square of the intensity units (iu 2 ) and as a function of the radial frequency i 1 / 2 ^ with L 1,/2 |it| ^ 32\/2 for the original 
signals multiplied by the primary beam in the above panel (continuous black curve), for the chirp modulation (continuous blue curve), and for the signals 
spread by the chirp modulation (continuous red curve). The bottom panels represent the SNR of the reconstructions multiplied by the primary beam as a 
function of the coverage identified by the number of complex visibilities M/2 £ {50, 100, 200, 300, 400, 500, 1000}. The curves correspond to the ABP e 
reconstructions (dot-dashed black curve), the ABP e l reconstructions (dot-dashed red curve), the rBP £ reconstructions (continuous black curve), and the 
rBP c l reconstructions (continuous red curve). The latter illustrate the spread spectrum universality relative to the sparsity dictionary. Each curve precisely 
represents the mean SNR over the 30 simulations, and the vertical lines identify the error at 1 standard deviation. 



optimizationQThe quality of reconstruction is analyzed in terms of 
the signal-to-noise ratio of the reconstructions multiplied by the pri- 
mary beam SNR = -20 log 10 (E (A(to> (<»-«)) /s( A<t ° 5 »0), whe re 
E' A< stands for the sample standard deviation of the original 
signal A ( ' o) x, and E (A for that of the discrepancy signal 

A (t0, (a; - sb). 

We actually compare two different settings for the reconstruc- 
tions. In the first setting, called rBP e , we assume that the Gaussian 
waveform dictionary with appropriate t is known, and we use it ex- 

4 The algorithm is based on the Douglas-Rachford splitting method 
(Combettes & Pesquet 2008) in the framework of proximal operator the- 
ory (Moreau 1962). Under our constant w assumption, the computation 
complexity is driven at each iteration by the complexity of the FFT, i.e. 
0(N log N). For the small value of N considered the reconstruction typi- 
cally takes less than 1 minute on a single standard processor. The algorithm 
therefore easily scales to much larger values of N. 



plicitly as sparsity dictionary: = P*'. As a consequence, the 
TBP e problem deals with the best possible sparsity value K — 10. 
The reconstructions in the absence (wd = 0) and in the pres- 
ence (wd = 1) of the chirp modulation are respectively denoted 
by rBP e and TBP e l. It is the precise setting in which we just 
brought up the spread spectrum phenomenon and its universality 
on the basis of considerations relative to the mutual coherence be- 
tween the sensing and sparsity dictionaries. In the second setting, 
called ABP E , we assume that the sparsity dictionary is the real 
space basis: W = A. As a consequence, the ABP e problem deals 
with the best possible coherence value /i(F, A) = Af~ 1//2 . How- 
ever, the sparsity computed in real space increases drastically with 
the size of the Gaussian waveforms, suggesting that the reconstruc- 
tion quality should clearly decrease when the Gaussian size in- 
creases. The reconstructions in the absence (wd = 0) and in the 
presence (wd = 1) of the chirp modulation are respectively de- 
noted by ABPeO and ABPel. But the mutual coherence here re- 
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mains unaffected by the chirp modulation, so that this modulation 
should fail to enhance the reconstruction quality in this case. 

This analysis structure is strongly suggested by our interest in 
understanding the behaviour of the standard reconstruction algo- 
rithms used in radio interferometry. On the one hand, the standard 
CLEA N algorithm is essentially a Matchin g Pursuit (MP) algo- 
rithm dMallat & Zhan3ll993l : lMallatlll998l) which works by iter- 
ative removal of the so-called dirty beam, i.e. the inverse Fourier 
transform of the mask, in real space dH6gbordl97llSchwardl978l ; 
iThompson et alj [2004"). It is also known that CLEAN provides re- 
constructi on qualities very similar to what we just called th e ABP £ 
approach jMarsh & Richardson|[l987l ; IWiaux et al.ll2009h . On the 
other hand, multi-scale versions of CLEAN can in principle ac- 
count for t he fact that the si gnal may be sparser in some multi-scale 
dictionary JComwelj|2008h . Assuming that the signals considered 
have a very sparse expansion in such dictionaries, the correspond- 
ing performances would probably be very close to that of our rBP E 
approach, provided that the equivalence between MP and BP still 
holds in a multi-scale setting. 

4.5 Results 

The results of the analysis are reported in the bottom panels of Fig. 
[2] The curves represent the mean SNR over the 30 simulations, and 
the vertical lines identify the error at 1 standard deviation. 

Firstly, for each size td of the Gaussian waveforms and each 
reconstruction procedure, the SNR obviously benefits from an in- 
crease in the number M/2 of visibilities measured, corresponding 
to an increase in the information directly available on the signal. 

Secondly, in the absence of chirp it clearly appears that, for 
each size td of the Gaussian waveforms and each number of visi- 
bilities measured M/2, the TBP E reconstruction exhibits a signif- 
icantly better SNR than the ABP e reconstruction. This suggests 
that for the signals considered it is better to optimize the sparsity 
by accounting for the proper dictionary, than to optimize the mu- 
tual coherence by formally postulating that the signal lives in real 
space. It also appears that, for each number of visibilities measured 
M/2, the SNR of both kinds of reconstructions significantly de- 
grades when the size td of the Gaussian waveforms increases. This 
behaviour is due to the fact that either the mutual coherence, in the 
case of rBP E 0, or the sparsity, in the case of ABP e 0, get farther 
from their optimal values. 

Thirdly, the SNR of the ABP E and ABP e l reconstructions 
remain undistinguishable from one another at 1 standard deviation, 
independently of the size td of the Gaussian waveforms and of the 
number of visibilities measured M/2. This illustrates the fact that 
the mutual coherence is already optimal in the absence of chirp 
modulation, so that any chirp modulation will fail to enhance the 
reconstruction quality. Let us also emphasize that we have imple- 
mented the standard CLEAN algorithm for comparison with this 
ABP £ setting. For each value of td and M and both for Wd — 
and Wd = 1, the corresponding SNR of reconstruction (not shown 
in Fig. |2]( remains, as expected, undistinguishable from both the 
ABP e and ABP e l reconstructions at 1 standard deviation. 

Fourthly, for any number of visibilities measured M/2 and 
for large enough size td of the Gaussian waveforms, the SNR of 
the rBP e l is significantly larger than that of the TBP e recon- 
struction. This is the spread spectrum phenomenon related to the 
reduction of the mutual coherence in the presence of the chirp mod- 
ulation. Moreover, the SNR of the rBP e 1 reconstruction is essen- 
tially independent of the sparsity dictionary identified by td- This 
supports very strongly the principle of universality of the spread 



spectrum phenomenon relative to the sparsity dictionary, in perfect 
agreement with our theoretical considerations. The reconstruction 
quality also appears to be more stable around the mean SNR values 
in the presence of the chirp modulation. 

Finally, let us note that our pure considerations on sparsity and 
mutual coherence led to relation (15) , which suggests the spread 
spectrum universality and associated optimal reconstructions in 
the rBP e setting in the limit of an infinite chirp rate. In practice 
though, the SNR of a TBP e reconstruction should saturate at some 
finite value, and progressively degrade above this value, due to a 
leakage phenomenon. The larger Wd, the larger the spreading of the 
spectrum of the signal. But the extent of the Fourier coverages con- 
sidered is limited by the band limit B^ N ' L K Consequently, when 
the band limit of the signal after chirp modulation significantly ex- 
ceeds the band limit where the visibility distributions are defined, a 
significant part of the energy of the signal remains unprobed. 

Simulations and reconstructions have actually been performed 
for a range of chirp modulations with chirp rates between u>d = 
and Wd — 1, as well as for a higher chirp rate Wd — 1.5, in both 
the TBP 6 setting and the ABP 6 setting. In the TBP 6 setting, the 
SNR of reconstruction (not shown in Fig. |2]( undergoes a natural 
continuous increase for values Wd ^ 1 from the rBP e curve to 
the rBP e l curve, confirming the spread spectrum phenomenon for 
all values of td and M considered. The saturation of the SNR oc- 
curs around Wd = 1, and its degradation is for example already 
significant for a chirp rate Wd = 1.5, for td = 2 and all values 
of M considered. In the ABP E setting, the SNR of reconstruction 
(not shown in Fig.[2j exhibits no evolution for values Wd ^ 1, in- 
dependently of the values of td and M considered, confirming the 
fact that the chirp modulation has no impact on the mutual coher- 
ence. However, the leakage phenomenon obviously also affects the 
reconstructions. The corresponding degradation is observed in the 
same conditions as for the rBP e setting. 

4.6 Comments 

A huge amount of work still needs to be envisaged for analyzing 
the effect of baselines with non-negligible component w in radio 
interferometry. We comment here on three important points. 

Firstly, all our results should be confirmed for various levels of 
instrumental noise before stronger conclusions are drawn. In partic- 
ular, possible implications of the spreading of the signal energy on 
all spatial frequencies probed due to the chirp modulation should 
be studied as a function of the noise level on each visibility. 

Secondly, we assumed perfect knowledge of a sparsity dic- 
tionary made up of simple Gaussian waveforms. A large range of 
multi-scale dictionaries, notably wavelet frames, may be used in 
which a large variety of natural signals are known to be sparse 
or compressible. Let us however acknowledge the fact that a non- 
optimal choice of a sparsity dictionary will of course have an effect 
on the sparsity and possibly degrade the reconstruction. In this per- 
spective, further analyses should be performed in order to assess 
the suitability of specific dictionaries. 

For signals with sparse or compressible gradients, a TV norm 
may be substituted for the l\ norm of the coefficients in the sparsity 
dictionary, in the very definition of the minimization problem. The 
TV norm of a signal is simply defined as the i\ norm of the mag- 
nitude of its gradient dRudin et al J 1 1992h . A theoretical result of 
exact reconstruction holds for such TV norm minimization prob- 
lems in the case of Fourier measureme nts of signals with ex actly 
sparse gradients in the absence of noise jCandes et al1l2006al) . But 
no proof of stability relative to noise and compressibility exists. 



© 2009 RAS, MNRAS OOO-ITHTol 



10 Wiaux et al. 



Such minimization is also accessible t hrough an iterative schem e 
from convex optimization algorithms l lCandes & Rombergll2005l) . 
Even though our signals would not primarily be thought to have 
very sparse gradients, the Gaussian waveforms have in practice 
well defined contours. Preliminary reconstruction results using this 
scheme actually show promising reconstruction qualities. 

Thirdly, further analyses should also be performed in order to 
understand the effect of the chirp modulation for realistic distri- 
butions of the baseline components (u, v, w), and in particular for 
non-constant component w. However, let us already note from the 
bottom panels of Fig.[2]that, for each size td of the Gaussian wave- 
forms, the SNR of the TBPjl reconstruction rises very steeply 
with the number of visibilities measured M/2, before saturating 
around some multiple of the sparsity K = 10. This suggests a 
selection procedure in cases where the total number of visibilities 
measured is very large relative to the sparsity considered. If a suit- 
able fraction of visibilities is associated with the suitable coverage 
of frequencies it;, in the Fourier plane and with a suitable identical 
value of w, only these visibilities might be retained in the problem. 
In such a case, our present considerations could apply directly. 

Let us also recall that the value Wd — 1 requires a strong align- 
ment of the baselines with the pointing direction for small fields of 
view, possibly corresponding to an unrealistically large component 
w. In this respect, the TBP E 1 reconstruction results are asymptotic 
values. However, a given baseline component w will actually corre- 
spond to a larger Wd on a larger field of view L and for a given band 
limit B I ' N ' L ' > , so that Wa = 1 will become more realistic. This is ac- 
tually the exact opposite argument to the one used for the definition 
of faceting algorithms, which decompose a given field of view on 
subfields where the effect of w is negligible. In that regard, the ex- 
tension of our res ults on wide fields of view on the celestial spher e 
will be essential l lCornwell et ai]|2008l : iMcEwen & Scaifell2008l) . 
notably with regard to forthcoming radio interferometers su ch as 
the Square Kilometer Array (SKA) JCarilli & Rawling s 2004)0 



5 CONCLUSION 

We have focused our attention on radio interferometers with small 
field of view and baselines with non-negligible and constant com- 
ponent in the pointing direction, for which a linear chirp modula- 
tion affects the astrophysical signals probed. Considering simple 
sparse signals made up of Gaussian waveforms, we have discussed 
the sensitivity of imaging techniques relative to the sparsity dictio- 
nary in the context of the theory of compressed sensing. A theoret- 
ical computation of the mutual coherence between the sparsity and 
sensing dictionaries, as well as the results of our numerical simula- 
tions, suggest the universality of the spread spectrum phenomenon 
relative to the sparsity dictionary, in terms of the achievable quality 
of reconstruction through the BP e problem. 
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